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ABSTRACT 

Given the failure of existing models for redshift-space distortions to provide a highly 
accurate measure of the /^-parameter, and the ability of forthcoming surveys to obtain 
data with very low random errors, it becomes necessary to develop better models for 
these distortions. Here we review the failures of the commonly- used velocity dispersion 
models and present an empirical method for extracting f3 from the quadrupole statistic 
that has little systematic offset over a wide range of j3 and cosmologies. This empirical 
model is then applied to an ensemble of mock 2dF southern strip surveys, to illustrate 
the technique and see how accurately we can recover the true value of [3. We compare 
this treatment with the error we expect to find due only to the finite volume of 
the survey. We find that non-linear effects reduce the range of scales over which (3 
can be fitted, and introduce covariances between nearby modes in addition to those 
introduced by the convolution with the survey window function. The result is that 
we are only able to constrain to a 1-er accuracy of 25% {j3 = 0.55 ± 0.14 for the 
cosmological model considered). We explore one possible means of reducing this error, 
that of cluster collapse, and show that accurate application of this method can greatly 
reduce the effect of non-linearities, improving the determination of (3. We conclude 
by demonstrating that, when the contaminating effects of clusters are dealt with, 
this simple analysis of the full 2dF survey yields (3 = 0.55 ± 0.04. For this model 
this represents a determination of (3 to an accuracy of 8% and hence an important 
constraint on the cosmological density parameter Qq . 

Key words: cosmology: theory - large-scale structure of Universe - galaxies: dis- 
tances and redshifts. 



1 INTRODUCTION 

The new generation of galaxy redshi ft surveys, re presented 
by the 2dF galaxy redshift surv ey (Colless 1995 ) and the 
Sloan digital sky survey (SDSS, Gunn & Weinberg 1995), 



will be capable of measuring the power spectrum and corre- 
lation function of galaxy clustering with an unprecedented 
degree of accuracy. One way of using this information to con- 
strain the fundamental cosmological parameters is through 
the redshift-space distortions of large-scale galaxy cluster- 
ing. Since the radial coordinate of a galaxy's position is 
measured by translating its redshift to a distance via the 
Hubble law, the inferred distance is prone to contamination 
by deviations from the uniform expansion of the Universe. 
This contamination is a systematic effect, and produces an 
anisotropic signal in the galaxy correlation function and 
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power spectrum. The strength of this anisotropy depends 
on the mass density of the Universe through the parameter 
P = fl°' 6 /&, where 6 is the bias factor, relating the amplitude 
of fluc tuations in t he galaxy density to those in underlying 
mass (Kaiser 1987). 



In order to fully exploit the high quality data that will 
soon be available, it is necessary to employ theoretical mod- 
els that ar e at least as accurat e as the data itself. In previ- 
ous work (Hatton & Cole 1998, hereafter HC98) we demon- 
strated that two methods often used to model the redshift- 
space distortions in the quasi-linear regime were not able to 
deal robustly with a wide range of cosmological parameters. 
Their use, when applied to a high-quality dataset, could lead 
to significant bias in determination of the mass density of 
the Universe. 

In this paper we use a la rge set of iV-body simulations, 
described in Cole et al. 1998 (hereafter CHWF), spanning a 
range of cosmologies and biased in a variety of ways, to show 
that none of the velocity dispersion models commonly con- 
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sidered is capable of estimating f3 without substantial bias. 
In section 2 we present an empirical model, based on results 
from these simulations, that provides an accurate way of 
extracting j3 from redshift-space distortion information over 
the full range of cosmologies used. 

In section |^, we address the question of how well the new 
galaxy redshift surveys will measure the distortion parame- 
ter (3. We use our empirical model from section ^ to fit the 
results of analysing mock 2dF redshift surveys, whose con- 
struction is described in CHWF. Repeating this procedure 
for several independent simulations of the same cosmology 
we estimate the scatter in estimated /3 values and hence the 
accuracy to which we will be able to measure j3. We com- 
pare this accuracy to that which we would expect to find if 
the density field were described by a Gaussian random field, 
and the only source of correlation between modes came from 
the survey window function. We show that this approxima- 
tion results in a substantially smaller value for the error 
expected on (3. We attribute this discrepancy to the pres- 
ence of extremely non-linear regions in the mock catalogues 
which cause coupling between modes. 

In section M we employ the technique of cluster col- 
lapse to effectively dampen the non-linearities present in 
non-linear regions. This method greatly reduces the error 
on /3, by extending the range over which our models are 
valid, and by removing some of the mode coupling. 

In section |B| we estimate the tightest possible constraint 
that we will be able to obtain from application of these meth- 
ods to the 2dF survey, by using the the full, magnitude lim- 
ited sample, rather than a volume limited subsample. We 
summarise and conclude in section ^| 



2 MODELLING THE REDSHIFT-SPACE 
DISTORTIONS 

In HC98 we considered two extensions to the linear theory 
of reds hift-space distortions: the Zel'dovich approximation. 
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Figure 1. The scaled quadrupole-to-monopole ratio for all the 
simulations and bias models described in CHWF. The y-axis is 
scaled to the expected, linear theory value of Q (equation 2.1), 
the x-axis is scaled to the zero-crossing scale of the quaHrupole, 
fe n i. The thick black line is our empirical fit, equation |2.2|. 



the simulations by selecting particles as galaxies depending 
on their local density, so as to produce a clus tering ampli - 



tude matching that seen in t he APM survey (Maddox, Ef- 



which 4as found to break down in the case of biased galaxy stathiou, fc Sutherland 199q ) . This bias is achieved using a 



distributions, and the exponential model of velocity disper- 
sions, which seemed to work well only in the limit that the 
velocity dispersions were high. The failure of these existing 
models to cope with a full range of cosmologies and biases 
is what motivates us to find a more accurate and robust 
model. 

Here we use a set of iV-body simulations spanning a 
broad range of parameter space, with a number of different 
biasing schemes, to look for an empirical function that will 
accurately model the strength of the quadrupole distortion 
of the galaxy power spectrum. We assume that an estima- 
tor which can accurately and robustly measure (3 from this 
broad range of simulations should be able to do the same 
when applied to a real galaxy sample. These simulations are 
exactly those described extensively in CHWF, but we will 
summarise their details here. We include both open (Ao = 0) 
and flat (fio + Ao = 1) cosmologies, with power spectra 
given by variants of the CDM spectrum, parameterised in 
Bardeen et al. 198€| . The amplitudes are set according to 
two methods, either so as to reproduce the local abundance 
of rich galaxy clusters (White, Efstathiou, & Frenk 1993 



range of different prescriptions, as described in CHWF, sec- 
tion 3.3. These prescriptions include high-peaks, power-law 
and sharp-threshold bias models. The simulations are run 
with 192 3 particles, on a grid of length 345.6ft -1 Mpc. We 
select 128 3 of these particles as galaxies using the biasing 
algorithm. 

We expect the redshift-space distortions to tend asymp- 
totically towards linear behaviour on the largest scales. 
The statistic we use to parametrise the distortions is the 
quadrupole-to-monopole ratio, Q(k) = P%(k)/ Po(k), where 
Po(k) and Piik) represent the monopole and quadrupole 
terms of the 3-dimensional redshift-space galaxy power spec- 
trum P(k). In the linear regime, a positive value of Q(k) is 
produced by coherent infall onto overdense regions and out- 
flow from underdense r egions. The distortion, computed first 
by Kaiser (Kaiser 1987), depends only on /3 and the resulting 
expression for the quadrupole-to-monopole ratio is 



4/3/3 + 4/377 
1 + 2/3/3 + /3 2 /5 



(2.1) 



(eg. Cole, Fisher, & Weinberg 1994). Most physical models 



Eke, Cole, fc Frenk 1996 ) , or to matc h the large-scale COBE 
observations (White fc Bunn 1995). We introduce bias to 



of local galaxy bias result in a bias function, b, that, despite 
being scale dependent on small scales, tends to a constant 
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value on large scales (Coles 1993; Mann, Peacock, & Heav- 
ens 199^j) . Thus, (3 itself is expected to be constant in this 
regime, and, as a starting point, any model of Q(k) should 
have this asymptotic behaviour on large scales. 

On smaller scales the quadrupole-to-monopole ratio is 
suppressed from its linear value. Eventually, Q(k) becomes 
negative, where the random motions inside highly non-linear 
overdense regions produce structures elongated along the 
redshift direction (fingers of God). Most of the information 
that can be used to constrain (3 comes from the quasi-linear 
regime, where Q(k) is suppressed but still positive. We there- 
fore seek a model for Q(k) over just this range of scales. 

We measure Q(k) from the simulations using the tech- 
nique of fast Fourier transforms, presented in HC98. We first 
simulate redshift-space anisotropy by picking one of the co- 
ordinate axes of the simulation as a redshift direction, and 
perturbing all the particle positions by their velocities in 
this direction. As noted in HC98, using a single line of sight 
for the simulation ensures that the distant observer approx- 
imation holds, by effectively placing the simulation at an 
infinite distance. The galaxy density field, S(r), is Fourier 
transformed to obtain 5(k), and P(k) = (<5(k) 2 } is taken 
as the estimate of the power spectrum at each grid point 
in fc-space. This three-dimensional, anisotropic power spec- 
trum is then decomposed into spherical harmonics to obtain 
the monopole (or spherically averaged power), Po{k), and 
quadrupole, P2(k). 

Fig. |l| shows the quadrupole-to-monopole ratio, Q(k) = 
Pi(k) I 'Po(fe), for all the galaxy distributions resulting from 
each cosmological model and local biasing prescription inves- 
tigated in CHWF. We have scaled the curves such that the 
x-axis is measured in units of the wavenumber, fc n l, where 
the quadrupole crosses zero, and the j/-axis is s caled by the 
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expected, linear theory value given by equation 2.1. 

At low k, there is substantial scatter between the curves, 
and a turn down from the linear value in the first two bins. 
This feature was explained in CHWF as resulting from a ran- 
dom down-turn in power on these scales in the simulations 
we used: the majority of the simulations are based on the 
same initial random phases, so this down-turn is repeated 
in all the simulations. 

Apart from this trend, it can clearly be seen that the 
curves generally have a common locus once they have been 
scaled in this way. It is thus reasonable to seek a model for 
Q(k) of the form Q(k) = Qu n f(k/k n i). A good empirical fit 
of this brm is provided by 



f(x) = l-x 122 , (2.2) 

which is shown by the heavy solid line in Fig. |l|. 

We have thus found a model with two free parame- 
ters, (3 and fc n i, which accurately describes the shape of the 
quadrupole-to-monopole ratio, Q(k), for the wide range of 
models considered in CHWF. 



Th is is rath er similar to the approach used by Fisher 
Nusser (1996) who fit an empirical curve to their an- 



alytic Zel'dovich approximations for the quadrupole using 
the same scalings. We stress that our approach should be 
far more general since it is based on the fully non-linear 
data provided by JV-body simulations, covers a much broader 
range of cosmologies, and includes the effects of bias. 
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Figure 2. Scatter plots for two realizations of the f2o = 1, tCDM, 
simulations (named E3SA and E3SB in CHWF), biased with each 
of the six biasing prescriptions described in CHWF. The point 
types reflect different bias models: model 1 (stars), 2 (crosses), 
3 (circles), 4 (open squares), 5 (solid triangles), 6 (open trian- 
gles). On the left we show comparisons of the fit f) with the input 
(3 = Q°-®/b, where the bias factor, b, is defined by the ratio of 
the fluctuations in spheres of radius 8/i _1 Mpc in the galaxy and 
matter distributions. On the right, the bias factor is instead de- 
fined on a larger scale, as the ratio of fluctuations in spheres of 
radius 20h~ 1 Mpc. 



2.1 Definition of bias 

The conventional method for specifying the bias factor, b, 
is by the ratio of rms fluctuations in spheres of radius 
8/i -1 Mpc in the galaxy density to those in the mass density. 
In the approximation that both power spectra have the same 
shape, or at least the same shape on scales that contribute 
to the fluctuations in 8/i -1 Mpc spheres, this is equivalent 
to a boost in the galaxy power spectrum normalization by a 
factor b 2 relative to the mass spectrum. Most physical pre- 
scriptions for galaxy formation, in contr ast, result in a bias 
that is to some extent scale dependent ( Man£^l^eacock^& 



Heavens 199£; Benson et al. 19991; Blanton et al. 1999) 



For example, the large-scale galaxy distribution may have a 
constant bias, but this may be reduced in clusters where the 
number density is sufficiently high that a significant amount 
of merging has occurred. 

In Fig. |2| we show the result of fitting our empirical 
model to the two simulations for which the widest range 
of biasing schemes is available. These rCDM simulations 
(named E3SA and E3SB in CHWF) have been biased using 
all six prescriptions described in CHWF. The left hand pan- 
els show the scatter of [3m versus [3g = ^o' 6 /bg. The diagonal 
line represents the ideal, one-to-one correspondence between 
the estimated and true values of (3. The labels at the bot- 
tom of each panel give the average difference between the 
fitted and true values, and also the scatter around the mean 
relation. As can be seen, for any one simulation, most of the 
biasing prescriptions result in the same values of f3s; not re- 
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ally surprising, since most of them were fixed to reproduce 
clustering on these scales. Different prescriptions for biasing 
the galaxy catalogues, however, result in systematically dif- 
ferent best-fitting values of (3, despite the constraint that f3& 
is the same. This produces significant scatter in the values 
of /3fi t for these different biases. 

In contrast, the scatter is much reduced in the right 
hand panels in which we plot /3g t against P20, where P20 is 
obtained using the bias factor, 620, defined by the ratio of 
galaxy to mass fluctuations in spheres of radius 20/i -1 Mpc. 
Ideally we would like to define the bias parameter on very 
large scales which would truly reflect the asymptotic value 
of the bias, but in practice the finite volume of the simula- 
tions makes estimates of the bias on such scales rather noisy. 
CHWF presented a method for calculating analytically the 
asymptotic value of the bias, but this method can only be 
applied to the subset of bias models that are based on the 
initial linear density field rather than the evolved non-linear 
density field. We expect 620 to be a good approximation to 
the asymptotic large scale bias in all cases. 

Use of P20 rather than /3s has a dramatic effect on the 
scatter of the different models, pulling them much closer 
to the j3 — (3m line. The scatter is reduced by a factor 
of four in these models. This tighter correlation indicates 
that the quadrupole, despite being measured over a range of 
mostly quasi-linear wavenumbers, constrains quite directly 
the large-scale linear bias factor. 
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2.2 Comparison with existing models 

We now compare our two-parameter model for Q(k) with 
the analytic velocity dispersion models which are commonly 
used in the analysis of redshift-space distortions. In fact, 
three models of velocity dispersion have generally been con- 
sidered in the literature: 

(i) Gaussian. Particle velocities are drawn from the dis- 
tribution V(v) oc exp(— v 2 /2a 2 ). In this case the convolution 
in real space results in a multiplication of 5(k) by the factor 
exp(— k 2 [i 2 a 2 /2), where fj, is the cosine of the angle between 
the wave- vector, k, and the line of sight. 

(ii) Exponential. The particle velocities have V(v) oc 
exp(— \/2\v\/ a v ). Then, <5(k) is multiplied by (I + 
k 2 i?o 2 j2)-\ 

(iii) Pairwise exponential. In this case, the pairwise ve- 
locity dispersion of the particles is assumed to come from an 
exponential distribution like that of the exponential model. 
The multiplicative factor is the square root of that in the 
previous case, and so the power spectrum itself (|5(k)| 2 ) is 
multiplied by (1 + k 2 f i 2 a 2 /2)~ 1 . 

The value of a v that appears in the pairwise formula is the 
pairwise velocity dispersion, equal to \/2 times the point- 
wise dispersion. Thus, if we Taylor expand each of the three 
factors, we find that the first order term is the same in each 
case; the distributions have the same width, but different 
shapes. 



As pointed out by Peacock & Dodds (1994), this effect 



cannot be considered in isolation; the linear Kaiser boost 
to the power spectrum also contains terms in /i, and so to 
extract the harmonic moments, Po(k) and P2{k), one must 
first multiply these two factors together before weighting 
by the appropriate Legendre polynomial and averaging over 



Figure 3. Comparison of our empirical model for Q(k) with 
results from a Gaussian, exponential, and pairwise exponential 
model. These models were computed for (3 = 0.55 and a one- 
dimensional point- wise velocity dispersion of 400km s . 



the line-of-sight angle. We use the maple computer pack- 
age to perform these calculations. In Fig. ^ we compare the 
three different velocity dispersion model predictions for the 
shape of Q(k) with the shape we obtain for our empirical 
model, which we know (from Fig. ^) is a reasonable fit to 
the simulations. It is clear that the shapes of the velocity dis- 
persion models do not match that of the empirical model, 
and that these models always underestimate the level of de- 
viation from the linear theory result on large scales. This 
breakdown is due to the violation of the assumption that 
non-linear behaviour arises purely through thermal motions 
in virialized systems at a fixed velocity dispersion. In fact, 
the velocity dispersion will be correlated with the density 
field, and coherent non-linear flows may play an important 
part at large scales. 

It is clear that using the velocity dispersion models to 
measure /3 will result in some degree of bias, since the curves 
are not a good fit to the ./V-body data. In order to assess the 
significance of this bias we use the FFT method of estimat- 
ing the redshift-space power spectrum and its quadrupole- 
to-monopole ratio from the simulation cubes, as outlined 
above and described in HC98, for the full set of simulations 
described in CHWF. 

In Fig. ^ we show a scatter plot for the behaviour of Put 
with the known value of P20, as defined above. Each panel is 
labelled with the dispersion model used to make the fit, and 
the figure includes every biasing scheme of every cosmologi- 
cal model presented in CHWF. It is immediately clear from 
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Figure 4. Scatter plots of /3g t for a sample of over one hun- 
dred simulations. Each panel shows the results of using a differ- 
ent redshift-space distortion model to extract the value of /3g t , as 
indicated by the name in the top left corner. We also quantify the 
systematic offset, (/3 — /3true), and the scatter, a, about the mean 
value. 



this figure that the velocity dispersion models all produce 
a systematic offset between the fitted and true values of 0. 
The pairwise exponential model comes closest to fitting the 
data, but this still results in a systematic underestimate of 
P by ~ 0.1. Judging fro m a number of recent results (for a 
summary, see table 1 of Hamilton 1998) ) , we expect to find 
P ~ 0.5, so this represents a 20% bias in measuring /3, cer- 
tainly rather larger than the random errors expected from 
large surveys like 2dF and SDSS. The estimator based on 
our empirical model, shown in the bottom right-hand panel, 
has similar scatter to the other estimators but there is no 
appreciable systematic offset over a wide range of (5. The 
only regime where the model appears to break down is for 
< 0.3, which is at the very low end of the range allowed 
by current observations. 



2.3 Intrinsic scatter 

In addition to the original two rCDM simulations (E3SA 
and E3SB of CHWF), we now have a further eight sim- 
ulations of the same cosmological model. All ten of these 
realizations have been biased with one of the bias prescrip- 
tions. The scatter in the resultant 0at values of a = 0.03 
is an indication of the limiting accuracy to which can be 
determined due to the finite volume of these simulations. 
This result indicates that the offsets in {0 — /3truo) seen for 
the empirical model fits in Figs. ^ and ^ are statistically 
insignificant. 



3 ANALYSIS OF REDSHIFT SURVEYS 

Having demonstrated that our empirical model for the 
quadrupole Q(k) can be used to make essentially unbiased 



estimates of the redshift-space distortion parameter 0, we 
now turn to the question of how accurately can be mea- 
sured from a real galaxy redshift survey. We choose to apply 
the t echnique to t he geometry of the 2dF South Galactic 
slice ( |Colless 1995| ). 

First, in section |Tl|, we address this question by making 
use of mock galaxy catalogues drawn from the ten indepen- 
dent biased rC DM iV-body simulations mentioned above. 
Then, in section |3.2| , we perform a similar analysis of ide- 
alized mock catalogues, in which the galaxy density field 
is assumed to be a Gaussian random field. The results are 



compared in section i.3 



3.1 TV-body mock catalogues 

The ten mock 2dF galaxy catalogues that we analyse here 
have many realistic features. They are based on TV-body sim- 
ulations in which the small scale structure has been accu- 
rately evolved into the non-linear regime. The galaxies are 
biased tracers of the underlying dark matter, and have a cor- 
relation function that, on small scales, matches that of the 
galaxies in the APM survey. The 2dF South Galactic slice is 
approximately 90° long in right ascension, with declination 
range 15° centred on 8 = 30°. Our mock catalogues have 
the same geometry on the sky, and are constructed with a 
similar radial selection function to that expected from the 
real survey. The way in which we deal with the wide opening 
angle of the survey and select the galaxy sample to analyse 
is outlined below. 

3.1.1 Distant observer approximation 



The Cartesian linear-theory formalism of Kaiser (1987 ) is 
applicable only when the line-of-sight direction is constant 
across the whole survey. In this case, when the redshift-space 
density field is decomposed into the sum of plane waves, only 
the components of the waves parallel to the line of sight are 



affect ed by the redshift-space distortion (Zaroubi & Hoffman 



199€ ) . Thus it is not directly applicable when the galaxy sur- 



vey subtends a large angle on the sky. Whilst there do exist 
methods of dealing with wide angle surveys in one go, by 



expanding the den sity field into spherical harmonics (Heav- 



ens & Taylor 1995), we do not concern ourselves with those 
here. Rather, we split the survey up into separate angular 
bins with relatively small opening angle, and treat each one 
with the plane parallel approximation. For the 2dF SGP 
geometry, the declination range is 15°. It thus seems rea- 
sonable to pick a right ascension range for the angular bins 
of comparable extent. The width of the South Galactic strip 
is ~ 90° cos 30° = 78° . We split the survey into three bins 
each spanning 26° in right ascension. Note that there are 
pairs of galaxies that span adjacent bins but still have open- 
ing angle less than the 26° bin width. To avoid throwing this 
data away, we also include the two overlapping bins. The ef- 
fect of a finite ope ning angle on the estimated valu e of 
has been studied by Cole, Fisher, & Weinberg (1995). From 
their fig. 8 it is clear that a 26° opening angle should only 
result in a one per cent underestimate of the quadrupole-to- 
monopol e rat io of the power spectrum. Since Qn n (0) is, from 



equation 2.1, approximately linear over a reasonabl e range 



of (esti mates of range from around 0.2 to 1.0 (Hamil 



ton 1998)), this introduces only a minor systematic error. 
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We thus perform the analysis for five bins of angular ex- 
tent « 26° x 15° and average the resulting power spectrum 
estimates. 



3.1.2 Volume limited samples 

The great advantage of surveys with such depth and sam- 
pling as 2dF and SDSS is the ability to construct volume 
limited samples of galaxies. We can thus look at the cluster- 
ing properties of a particular class of galaxies, avoiding the 
problems that arise if bias is a function of luminosity. 

In a volume limited sample, we throw away all the 
galaxies beyond a certain redshift limit, and all galaxies with 
absolute luminosities such that they would not make it into 
the catalogue if they were located at that redshift limit. 

For the genuine survey it will be interesting to carry 
out analysis of redshift-space distortions for a set of volume 
limited catalogues of varying depth. Here we simply illus- 
trate the analysis for one volume limit chosen to be close to 
optimal for determining /3. As the volume limit is increased 
we retain only the brighter galaxies, and so the galaxy num- 
ber density decreases and the shot-noise contribution to the 
power spectrum increases. For an accurate constraint on f3, 
we should choose a depth for which the survey will give reli- 
able data, without errors due to shot noise dominating over 
the contribution due to the finite size of the survey volume. 
For the purposes of illustration, we have chosen this depth 
to be such that the resulting shot-noise contribution to the 
power spectrum is equal to that of the clustered galaxy dis- 
tribution at k — 0.3/i Mpc -1 , so that on larger scales the 
error on P(k) is not shot-noise dominated. The reason for 
this choice of scale is simply that we are going to fit our 
model for the quadrupole distortion only as far as the zero- 
crossing scale fc n i, which, for the model under investigation, 
is fe„i w 0.3/i Mpc -1 . The resulting redshift limit arising from 
this procedure is z ~ 0.29. The median redshift of the survey 
is rather lower than this, z « 0.1. 

At this limit, the survey is very sparse, with n ~ 5 x 
10~ 4 h 3 Mpc -3 . We stress that this sacrifice in density has 
been made in order to probe the longest, most linear scales 
as accurately as possible, and that the resultant high shot 
noise has little or no detrimental effect on measurements of 
the long wavelength, high amplitude modes. 

In catalogues with this depth, we expose ourselves to 
a number of factors which can contaminate the clustering 
signal. The ten mock catalogues considered here are con- 
structed from simulations with fio = 1, Ao = 0. In general, 
the mapping from redshift to comoving distance is a function 
of the cosmology, and so, if the wrong cosmology is assumed 
in this transformation, the size of a volume element will 
appear to change with redshift, essentially mixing the mea- 
sured clustering between scales. This effect is only expected 
to be noticeable for high redshift samples, z > 0.1, and so 
a sample of this depth would require investigation to deter- 
mine the magnitude of potential bias given the uncertainty 
in cosmology. Inherent in having a significant look-back time 
is the possibility for significant evolutionary effects within 
the sample. As the density field evolves under gravitational 
instability, the strength of clustering increases. Conversely, 
if the galaxies are biased tracers of the density field, the 
bias factor approaches unity as evolution progresses (Fry 
1996), so the resultant effect on galaxy clustering is quite 



non-trivial to compute. Furthermore, we should allow for 
the possibility that the galaxies can themselves evolve, so 
a sample of galaxies between certain luminosity bounds at 
high redshift will not necessarily have the same clustering 
properties as a similar sample in the local Universe. 

The interplay between these factors is complex, and we 
make no attempt to deal with any of these effects here, as- 
suming that they are either insignificant, or that they can be 
accurately corrected for in real datasets. In section H we will 
discuss the increase in accuracy that comes about if we deal 
with the whole, magnitude limited catalogue rather than a 
volume limited subsample. 



3.1.3 Estimates of f3 

In order to estimate j3 for each mock catalogue we first esti- 
mate the power spectrum and Q(k) for each catalogue using 
a standard FFT method. To perform the fit we also need 
an estimate of the error on each measurement of Q(k) and 
ideally the covariance of the estimates of Q(k) at different 
values of k. These covariances arise through the finite size 
of the window function, and we estimate the full covariance 
matrix introduced by this effect using the method of multiple 
idealized mock catalogues that will be described in the fol- 
lowing section. We then use this matrix to make generalised 
minimum-x 2 fits of the empirical model of Q(k) to each of 
the ten realizations. This process results in an estimate of j3 
for each of the mock catalogues. 

The true value of j3 in these simulations is (the stars in 
Fig. ||) /Struc = P20 = 0.55, while from the mock catalogues 
we find an average value of /3 av = 0.493, with a scatter of a — 
0.173. First we note that the difference between /3truc and /3 av 
is not statistically significant given that the standard error 
on the mean of the ten realizations is 0.173/VTO = 0.055. 
The scatter on /3, however, is around 35% of the mean; this 
is far in excess of the uncertainty we hope to achieve with a 
survey of the size of 2dF. 



3.2 Idealized linear mock catalogues 

We wish to isolate the reason why the scatter in the esti- 
mated (3 values is substantially larger than expected. To do 
this we have generated a large set of idealized mock cata- 
logues in which the galaxy density is explicitly assumed to 
be a Gaussian random field rather than the result of biasing 
an evolved A-body simulation. 

We construct the density field on a cubic grid by first 
obtaining its Fourier transform, using a simple model for the 
redshift-space power spectrum P s (k, /x) and random phases. 
We choose one axis to be the line-of-sight direction and so as- 
sume that it is constant across the whole survey. This means 
we a void t he problem of large opening-angle discussed in sec- 
and if we wish we can analyse the whole survey 



3.1.1 



tion 

volume in one piece rather than having to slice it into five 
wedges of smaller opening angle. The difference in accuracy 
between these two approaches indicates the scope for im- 
proving the estimates of j3 by carrying out the analysis using 
spherical harmonics, which in principle allow the effects of 
a large opening angle to be treated precisely. 

The redshift-space power spectrum we adopt can be 
expressed as 
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P 3 (k,ix)=P L (k)K( f i)E(k, f i)+P Bhot . 



(3.1) 



Here Pt(k) is the isotropic real space power spectrum which 
we assume to be gi ven by the CDM power spectrum ol 
Bardeen et al. (1986| ): 



P L (k) oc 



k n X [ln(l + 2.34q)/2.34q] : 



1 + 3.89g + (16.1q) 2 + (5.46q) 3 + (6.71q) 4 ] 1 /2 



,(3.2) 



where q = k/T. As with the galaxy power spectrum of 
the A^-body catalogues we set n — 1, F = 0.25, as sug- 



gested by ob servations of large-scale structure (Peacock & 
Dodds 1994) and fix the amplitude such that as = 0.96 



the val ue determined for galaxies in th e APM survey (Mad- 



dox, Estathiou, fc Sutherland 1996). The second factor 



K(fi) = ( 1 + 0ii ) , is t he linear theory redshift-space dis- 



tortion of Kaiser (1987). The effect of small scale velocity 
dispersion is modelled using the pairwise exponential dis- 
persion model by setting E(k, /i) = [1 + (ka v /j,) 2 /2]~ 1 with 
a v — 500km s _1 . This is the dispersion model that comes 
closest to fitting the shape of Q(k) in the TV-body cata- 
logues and the value of a v adopted produces a zero-crossing 
of Q(k) at close to the fc n i ~ 0.3/t Mpc -1 found in those cat- 



alogues. The final term in equation 3.1 represents the shot 
noise in the catalogue and we set P s hot = 1/n, where n is 
the galax y nu mber density in the volume limited catalogues 
of section 



3.1 



This model ensures that the power spectrum of these 
idealized galaxy density fields matches well those of the vol- 
ume limited Af-body mock catalogues analysed above. We 
now give each mode a random complex phase and use an 
FFT to produce the galaxy density field in a cube. This 
field is then sampled using the survey window function. We 
can then either split this volume into five wedges or analyse 
the whole volume in one piece. We can also vary the size of 
the cube in which the density field is constructed to assess 
how this effects the constraint on 0. 



3.2.1 The survey window function 

The effect of sampling the galaxy density field only inside 
certain angular boundaries and with a particular radial se- 
lection function can be modelled by multiplying the full 
galaxy density field, <5(r), by a spatial window function, 
W(r). The effect on the Fourier transform of the density 
field is that of a convolution with the transform VF(k), and 
similarly the power spectrum is convolved with |V^(k)| 2 . 
This process can be thought of as a smoothing of the three- 
dimensional power spectrum, -P(k), with an anisotropic win- 
dow function IW^k)! 2 . On large scales (small k) the effect 
of this is to suppress the quadrupole-to-monopole ratio. In 
fitting our models to the Q(k) measured from the mock sur- 
veys we take account of this by first explicitly convolving the 
model power spectrum with the survey window function. We 
do this numerically using FFTs and the convolution theo- 



3.2.2 Estimates of 

As with the A^-body mock catalogues we estimate Q(k) for 
each realization of an ensemble of mock catalogues. Here we 
can use many more than ten and so determine the mean, 



variance and covariance of the values of Q(k) very accu- 
rately. We use the resultant covariance matrix to perform a 
fit to Q(k) in each realization over the same range of k. In 
this case the model for Q(k) that we fit is that for the ana- 
lytic pairwise exponential model that we used to construct 
the power spectra. Thus we are again in the idealized realm 
of having a completely accurate model for the redshift-space 
distortions, but we have shown that the empirical model is 
indeed a good fit to the TV-body simulation data. Both mod- 
els for Q(k) have two parameters which control the large- 
scale asymptote and the zero-crossing, so it is reasonable to 
assume that confidence limits on derived using one model 
will be similar to those using the other. 

The resulting scatter in when we analyse the ideal- 
ized mock catalogues split into the five wedges as we did 
for the Af-body mock catalogues is 20%. This result is al- 
most a factor of two smaller than the scatter we found in 



section 3.1.3 



If we analyse the whole survey coherently, without split- 
ting it up to satisfy the distant observer approximation, then 
we find a scatter of 16%. This change from 20% to 16% rep- 
resents the modest improvement that can be made if one 
accurately deals with the complications of the effects of the 
large opening angle of the 2dF survey. 

The effect of increasing the simulation box size to the 
point where it is much larger than that used for the N- 
body simulations and much larger than the depth of the 2dF 
survey is to reduce the scatter from 20% to 18%. Thus little 
information is lost due to the finite volume of the TV-body 
simulations used to create the mock catalogues. 

3.3 Discrepancy 

We now address the question of why the mock catalogues 
based on the Af-body simulations produced an error on the 
estimated value of almost twice as large as the idealized 
analysis predicts. 

Fig. U shows the fractional error on P(k) and the abso- 
lute error on Q(k) as a function of wavenumber. It compares 
the variance from the ten independent A'-body mock cata- 
logues with the error we find from the idealized Gaussian 
random fields. The correspondence between the two error 
estimates clearly indicates that the Gaussian random field 
method has not under-predicted the degree of uncertainty 
on the distortion of each individual mode. We also note that 
these error estimates are in good agreement with the values 



predicted by the method of er ror analysis detailed in Feld- 



man, Kaiser, & Peacock 1994. We therefore conclude that 
the reason for the difference in the errors in must be due 
to the existence of significant correlations between Q(k) at 
different values of k. These correlations are over and above 
those that are induced by the shape of the survey window 
function, which we have taken into account in both methods. 
Therefore they must be a result of non-linear gravitational 
evolution and so cannot be modelled by a Gaussian density 
field. 

Gravitational instability on small scales causes coupling 
between different modes of the density field, resul ting in cor- 
relations be tween the phases of these modes ( Meiksin & 
White 199!;). We fail to take into account this non- linear 
effect when using idealized, linear mocks, but it is implicit 
in the Af-body method which has followed the evolution of 
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Figure 5. The fractional error on the power spectrum (upper 
panel) and absolute error on the quadrupole-to-monopole ratio 
(lower panel) . The solid line is from the variance between then ten 
independent JV-body mock catalogue realizations of the survey, 
the dotte d lin e is from one hundred 'linear' mocks as described 



in section 3.2. We have attached Poisson error bars to the points 



using the relation that the standard deviation of the scatter is 
given by cr ss = a s / ^ /2(N — 1). wh ere cr a 
from N realizations (Barlow 1989). 



is the measured scatter 



these modes accurately. We only expect linear theory to be 
valid for A 2 (k) < 1, where A 2 (fc) = 47r/(27r) 3 fc 3 P(fc), and 
this condition is certainly not met in our A^-body simula- 
tions around the zero-crossing of the quadrupole. However, 
the strength of the effect is perhaps initially surprising. 

As a first attempt to deal with these non-lin earities, we 
repeat the TV-body fits described in section 3.1.3, but replace 



the covariance matrix used previously with a new matrix 
that is obtained from the ten realizations themselves. The 
off-diagonal terms in this matrix arise from mode coupling 
caused by the survey window function and also non-linear 
evolution of the density field. The error in the estimate is 
now reduced from 35% to 25%. This reduction occurs be- 
cause previously we attached much weight to the quasi-linear 
regime, to modes at intermediate wavelengths close to, but 
longer than, k n l, where the quality of data appeared good. 
In actual fact, this regime is also the scene of strong non- 
linear mode coupling, so the data are not in fact as good as 
previously believed. Applying the more realistic weighting 
scheme makes better use of the data, and hence produces 
less scatter in the estimate of (3. Unfortunately, this method 
of obtaining the covariance matrix is time-consuming, since 
it involves producing multiple large- volume iV-body simula- 



tions for each cosmology and power spectrum that is to be 
studied. 



3.4 Summary 

In sections 2 and 3, we have: 

• used biased N-ho&y simulations to develop an accurate, 
empirical model for Q(k). 

• used mock catalogues drawn from these simulations to 
illustrate how accurately (5 can be measured in practice. 

• used linear mock catalogues to show how much of the 
uncertainty in /3 comes from just the finite volume effect. 

• demonstrated that, in fact, a major source of random 
error is the coupling of non-linear modes. 

• shown that allowing for this coupling (by using a covari- 
ance matrix derived from the full mock catalogues, rather 
than the linear mocks) results in a large improvement in 
constraining /3. 

The error has been reduced by allowing for non-linear 
effects, but is still rather large, in excess of the level of con- 
straint that could be achieved if non-linearities were not 
present at all. In the next section, we outline a practical 
method for reducing the non-linearities and thus increasing 
the accuracy of the es timates. It is also worth mentioning the 
approach adopted by Hamilton (1999), who finds a method 



for constructing a near minimum variance estimator for the 
non-linear power spectrum based on assembling a set of al- 
most uncorrelated non-linear modes. The use of such opti- 
mal methods in both the non-linear and linear regimes may 
provide the capability to constrain j3 rather better than we 
do in this work, although the methods are still in the de- 
velopment stage and it is not clear how easily they can be 
applied to a real dataset. 



4 COLLAPSING CLUSTERS 

In the dispersion models one assumes that the random ve- 
locities induced by non-linearities are uniform and uncor- 
related with the density field. In reality we do not expect 
this to be the case; the coupling between fe-modes is a local 
effect in r-space, taking place inside particular non-linear 
structures. In these volumes, mode coupling is extremely 
strong. As pointed out in HC98, if we can somehow excise 
these regions from our treatment, we could create a bet- 
ter behaved, more linear quadrupole-to-monopole ratio. In 
addition, this would have an even more dramatic effect in 
lessening the correlations between modes on these scales. 
This result could be achieved by identifying and removing 
the signal from clusters, as described in HC98. 

Here we illustrate the application of this method. Tak- 
ing the ten simulations used earlier, we identify clust ers in 
real space using a friends-of- friends (Davis et al. 1985) algo- 



rithm with linking length 0.2 times the mean inter-particle 
separation. We select clusters with ten or more members, 
and compute the mean velocity of each cluster by averag- 
ing over the velocities of its members. The cluster members 
then have their velocities in the simulation replaced with 
that of their parent cluster. In this way, it was shown in 
HC98, the effect of non-linear velocity dispersions can be 
heavily damped. 
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In a real survey, clusters will have to be measured with- 
out the perfect knowledge of galaxy density field which we 
have assumed here, and in redshift space rather than real 
space. This is likely to involve a number of parameters and 
considerations, and from this point of view it seems pointless 
to develop an accurate model for the cluster-collapsed Q(k) 
since this will not be generally applicable. Here we instead 
take as a model for Q(k) the shape measured from averaging 
the ten full-cube simulations. We multiply this by a single 
free parameter, Qu n , which we assume is related to (5 via 
the linear theory result (equation ^.l| ). For the set of rCDM 
simulations, non-linearities are suppressed to the extent that 
the quadrupole now crosses the fc-axis at fe n i = 0.8/iMpc . 

Having derived this model from the simulations, we now 
apply it to mock catalogues. These are created from the 
cluster-collapsed simulations, with the same parameters and 
selection functions as the mock 2dF SGP catalogues used in 
section |^. We then apply the same FFT technique explained 
above to measure Q(k) in the mock catalogues. 

We perform the fits over the same range of k as before 
(0 < k < 0.3/i Mpc -1 ), using a covariance matrix obtained 
from the ten independent cluster-collapsed mock catalogues. 
We find that the scatter in j3 is reduced from 25% to 13%. 
This is better than the result from the Gaussian random 
fields and indicates that collapsing clusters has successfully 
removed most of the mode correlations on these scales. Col- 
lapsing the clusters has pushed the zero-crossing of Q(k) to 
smaller scales, so we may be able to reduce the uncertainty 
still further by extending the range over which we fit the 
model. However, this is not the case; extending the fits to 
k — O.6/1 Mpc -1 results in no significant increase in accu- 
racy. This is because the data in this range is much noisier 
than in the uncollapsed case, aiding little in fitting the the 
curve. 

We note that now we are using scales that were previ- 
ously quasi-linear to determine /3, so the fits will not be 
greatly improved by relaxing the small angle constraint. 
Much of the new data is coming from pairs whose sepa- 
ration on the sky is small compared to the full extent of the 
survey. 



5 A MAGNITUDE LIMITED SAMPLE 

Throughout the above work we have used the same volume 
limited sample with zu m = 0.29 to measure /3. Volume limit- 
ing selects a subsample of galaxies, which can be very useful 
for studying clustering behaviour as a function of luminos- 
ity, but at the expense of discarding the majority of galaxies 
and hence severely reducing the number density of objects. 
If we are interested in using the survey to measure j3 as well 
as possible for all the galaxies in the catalogue, we can ex- 
tend the treatment to the magnitude limited case. For the 
case of a real survey, this must be done with caution, since 
we know that galaxies of different luminosity exhibit differ- 



modes. We weight galaxies using t he scheme introduced by 



Feldman, Kaiser, & Peacock (1994) 



ent clustering properties (eg. Loveday, Tresse, & Maddox 



199£). In our mock catalogues, however, there is no depen- 



dence of clustering strength on galaxy luminosity, so any 
sample should measure the same power spectrum and hence 
the same /3. A magnitude limited sample will be more accu- 
rate since the number of galaxies is higher, and it includes 
galaxies at higher redshifts, probing more long-wavelength 



w(r) 



1 +n(r)P w 



(5.1) 



where n(r) is the number density at position r, and we set 
= 5000/2 3 Mpc -3 , roughly reflecting the value of the 
galaxy power spectrum around the middle of the range over 
which the fit is performed (k ~ 0.15/i Mpc -1 ). This weight- 
ing is designed to minimize the variance in estimates of P(k) 
where P(k) ~ P w . 

Using this method, applied to the cluster-collapsed cat- 
alogues described in the previous section, results in a reduc- 
tion of the error on (3 from 13% to 10%. A similar improve- 
ment is expected for the uncollapsed case. 



6 CONCLUSIONS 

There are four distinct results presented in this paper: 

(i) We have found an empirical model for the quadrupole 
moment, Q(k), of the redshift-space power spectrum which 
matches the behaviour of this quantity in a wide range of 
cosmological models much more accurately than existing an- 
alytic models. We have demonstrated that it can be success- 
fully used to obtain unbiased estimates of the redshift-space 
distortion parameter (3 = Q 6 /b. 

(ii) We have shown that the resultant measurement of j3 
is better correlated with the large-scale bias factor than the 
usual quasi-linear measure from spheres of radius 8/1 -1 Mpc. 
The quadrupole is thus not sensitive to small-scale scale- 
dependence of the bias. 

(iii) We have demonstrated that mode coupling caused by 
non-linear gravitational evolution causes strong correlations 
between the measured values of Q(k) at different values of k, 
even on quite large scales. This implies that estimates of the 
accuracy to which f3 can be constrained that are based on 
assuming the galaxy density field is Gaussian significantly 
under estimate this error. 

(iv) The most non-linear regions of the density field are 
galaxy clusters. Although these are compact structures in 
real-space they are very extended in redshift space and so 
can affect quite large scale modes in the redshift-space den- 
sity field. We have shown that by identifying and collapsing 
galaxy clusters we are broadly successful at both reducing 
the correlation and increasing the range of scales over which 
Q(k) can be reliably fitted. 

For a set of 2dF southern slice mock catalogues which 
have a true value of (3 = 0.55, our empirical model of red- 
shift space distortion produces unbiased estimates of j3 with 
a 25% statistical error when measured from a volume lim- 
ited sample with limiting redshift zn m = 0.29. This model 
is applicable to the raw, redshift-space galaxy distribution 
and does not require the identification or manipulation of 
galaxy clusters. We estimate the statistical error will be sig- 
nificantly reduced (to around 20%) if the survey is analysed 
in one piece, rather than being chopped into five overlap- 
ping wedges, which we do in order to satisfy a small angle 
constraint. 

If clusters can be successfully identified and collapsed in 
the 2dF galaxy redshift survey, analysis of the main south- 
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ern slice should constrain (3 to a greater accuracy of 13% 
(volume limited) and 10% (magnitude limited). This error 
is not expected to be significantly reduced by relaxing the 
constraint of the distant observer approximation and using 
galaxy pairs at large angular separation, since much more 
of the information now comes from pairs that are physically 
nearer each other, and hence closer together on the sky. 

It is worthwhile noting that TV-body simulations, when 
normalized to replicate the observed abundance of galaxy 
clusters, produce pairwise dark-matter velocity dispersions 
whi ch are significantly higher than those measured for galax- 
ies (Jenkins et al. 1998). This problem is largely alleviated in 



new models which attempt to realistically identify the places 
where galaxies for m, either by numeri cal iV-body hydrody- 
namic simu lation (Jenkins et al. 1999 ), or by semi-analytic 
modelling (Benson et al. 1999a). However, the simple bias 
models used in this paper produce pairwise galaxy velocity 
dispersions which are close to those of the dark matter and 
therefore somewhat higher than for observed galaxies. It is 
to be hoped that this fact alone will render observational 
datasets somewhat less prone to non-linear effects than we 
estimate from our simulations, and the 10% statistical error 
we find may be a little conservative. 

The 2dF survey also contains a slice in the northern 
Galactic hemisphere. This is a little smaller than the south- 
ern slice, but measuring (3 from this data and combining 
it with the southern slice provides an additional constraint. 
Assuming the errors add in quadrature, the uncertainty is 
reduced by a factor « 1.25. 

We thus conclude that the quadrupole moment of the 
redshift-space distortion measured in the 2dF survey will be 
capable of constraining f3 to an accuracy of 8%. This will 
yield an important constraint on the density parameter SIq . 
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